Hierarchical Clustering of Gene Expression Data
نویسندگان
چکیده
Rapid development of biological technologies generates a hug amount of data, which provides a processing and global view of the gene expression levels across different conditions and over multiple stages. Analyzation and interpretation of these massive data is a challenging task. One of the most important steps is to extract useful and rational fundamental patterns of gene expression inherent in these huge data. Clustering technology is one of the useful and popular methods to obtain these patterns. In this paper we propose a new hierarchical clustering algorithm to obtain gene expression patterns. This algorithm constructs a hierarchy from top to bottom based on a self-organizing tree. It dynamically finds the number of clusters at each level. We compare our algorithm with the traditional hierarchical agglomerative clustering (HAC) algorithm. We apply our algorithm to an existing 112 rat central nervous system gene expression data. We observe that our algorithm extracts patterns with different levels of abstraction. Furthermore, our approach is useful onion. Furthermore, our approach is useful on recognizing features in complex gene expression data.
منابع مشابه
به کارگیری روشهای خوشهبندی در ریزآرایه DNA
Background: Microarray DNA technology has paved the way for investigators to expressed thousands of genes in a short time. Analysis of this big amount of raw data includes normalization, clustering and classification. The present study surveys the application of clustering technique in microarray DNA analysis. Materials and methods: We analyzed data of Van’t Veer et al study dealing with BRCA1...
متن کاملModification of the Fast Global K-means Using a Fuzzy Relation with Application in Microarray Data Analysis
Recognizing genes with distinctive expression levels can help in prevention, diagnosis and treatment of the diseases at the genomic level. In this paper, fast Global k-means (fast GKM) is developed for clustering the gene expression datasets. Fast GKM is a significant improvement of the k-means clustering method. It is an incremental clustering method which starts with one cluster. Iteratively ...
متن کاملبه کارگیری خوشهبندی دوبعدی با روش «زیرماتریسهای با میانگین- درایههای بزرگ» در دادههای بیان ژنی حاصل از ریزآرایههای DNA
Background and Objective: In recent years, DNA microarray technology has become a central tool in genomic research. Using this technology, which made it possible to simultaneously analyze expression levels for thousands of genes under different conditions, massive amounts of information will be obtained. While traditional clustering methods, such as hierarchical and K-means clustering have been...
متن کاملبررسی اثرات تغییر بیان ریز آر ان ای های سلولی ناشی از ویروس پاپیلوم انسانی در سلول های سرطانی سنگفرشی سر و گردن در سطح پروفیل بیان ژنی
Background and aim: Human Papilloma Virus plays an important role in some of human malignancies and causes alterations in normal expression levels of cellular microRNAs. In this paper, we evaluated the effects of such changes on Head and Neck Squamous Cell Carcinoma tumor samples at gene expression profile level. Methods: in this descriptive-analytical study, gene expression profiles of 36 tum...
متن کاملخوشهبندی دادههای بیانژنی توسط عدم تشابه جنگل تصادفی
Background: The clustering of gene expression data plays an important role in the diagnosis and treatment of cancer. These kinds of data are typically involve in a large number of variables (genes), in comparison with number of samples (patients). Many clustering methods have been built based on the dissimilarity among observations that are calculated by a distance function. As increa...
متن کاملGraph Clustering by Hierarchical Singular Value Decomposition with Selectable Range for Number of Clusters Members
Graphs have so many applications in real world problems. When we deal with huge volume of data, analyzing data is difficult or sometimes impossible. In big data problems, clustering data is a useful tool for data analysis. Singular value decomposition(SVD) is one of the best algorithms for clustering graph but we do not have any choice to select the number of clusters and the number of members ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003